CDS

Accession Number TCMCG075C09933
gbkey CDS
Protein Id XP_017972864.1
Location join(27810218..27810540,27811095..27811241,27811410..27811563,27812216..27812351,27812781..27812926,27813596..27813691,27813854..27813906,27814958..27815089,27815773..27815824,27815929..27816048,27816357..27816491,27817015..27817143,27817623..27817700,27817905..27818045,27818313..27818407,27818904..27819100,27819269..27820056,27820434..27820514,27820601..27820666,27820763..27820840,27821540..27821770,27822143..27822274,27822389..27822565,27822785..27822970,27823431..27823816,27824524..27824638)
Gene LOC18605417
GeneID 18605417
Organism Theobroma cacao

Protein

Length 1457aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018117375.1
Definition PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category A
Description Cleavage and polyadenylation specificity factor
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K14401        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03015        [VIEW IN KEGG]
map03015        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003723        [VIEW IN EMBL-EBI]
GO:0003729        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0006139        [VIEW IN EMBL-EBI]
GO:0006378        [VIEW IN EMBL-EBI]
GO:0006379        [VIEW IN EMBL-EBI]
GO:0006396        [VIEW IN EMBL-EBI]
GO:0006397        [VIEW IN EMBL-EBI]
GO:0006725        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0010467        [VIEW IN EMBL-EBI]
GO:0016070        [VIEW IN EMBL-EBI]
GO:0016071        [VIEW IN EMBL-EBI]
GO:0031123        [VIEW IN EMBL-EBI]
GO:0031124        [VIEW IN EMBL-EBI]
GO:0034641        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043631        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0046483        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0090304        [VIEW IN EMBL-EBI]
GO:0090305        [VIEW IN EMBL-EBI]
GO:0090501        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:1901360        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAGCTACGCGGCCTACAAGATGATGCACTGGCCCACCGGAATCGATAACTGCGCCTCCGGCTTCGTCACCCACTGCCGTGCGGATTTCACGCCTCAAATTCCGCTCAACCAGACTGAAGATCTTGAATCTGAATGGCCCGCAAGGCGCGGCATTGGTCCCGTTCCGAACCTCATCGTCACCGCTGCTAACCTTCTCGAAATCTATGTGGTCAGGGTTCAAGAAGAGGGCAGAAGGGAAGCCAGAAACTCCACTGAAGTCAAGCGCGGCGGGGTTTTGGATGGGGTATCCAGGGTTTCTCTCGAGCTCGTTTGCAATTATAGGTTACATGGTAATGTTGAATCTATGGCGGTACTATCTATAGGAGGTGGTGATGGCTCTAGGAGGAGAGATTCAATTATCTTAGCCTTTCAAGATGCCAAAATTTCGGTCCTGGAGTTTGATGATTCCATCCATGGTCTTCGAACAACCTCAATGCATTGCTTTGAGGGCCCGGAGTGGCTTCATTTGAAAAGAGGAAGAGAATCATTTGCTAGAGGGCCACTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGCGGCGTTCTTGTTTATGATTTGCAAATGATAATACTTAAAGCTTCTCAGGCTGGTTCTGGATTTGTGGGAGAGGATGATGCTTTTGGATCTGGAGGTGCAGTTTCTGCTCGTGTTGAGTCATCTTACATTATCAATTTACGAGATTTGGACGTGAAGCACATTAAAGATTTTATATTTGTGCATGGGTATATTGAGCCGGTGATGGTTATCCTTCATGAGCGGGAGCTTACTTGGGCTGGGCGAGTCTCTTGGAAGCACCATACTTGCATGATTTCTGCACTTAGTATTAGCACAACCTTGAAGCAGCATCCTCTCATATGGTCAGCAGTTAATCTTCCTCATGATGCTTACAAGCTGCTTGCAGTTCCGTCGCCAATTGGAGGTGTTCTTGTGATTAGTGCAAATACTATTCATTATCATAGTCAGTCGGCTTCATGCGCACTTGCTTTGAACAATTATGCTATTTCTGTTGATAACAGTCAAGACCTTCCAAGATCAAATTTCAGTGTAGAACTTGATGCTGCTAATGCAACTTGGTTACTAAATGATGTAGCCTTGCTATCAACAAAGACTGGAGAACTGTTATTGCTGACCCTTATTTATGATGGGCGGGTTGTGCAGAGACTTGATCTTTCCAAGTCCAAGGCTTCAGTACTTACTTCGGACATTACAACTATTGGAAATTCATTGTTCTTTTTGGGTAGTCGATTGGGAGATAGTTTGCTTGTGCAATTCAGTGGTGGATCAGGAGCGTCAGCCTTGCCATCTGGTTTGAAGGAAGAGGTTGGAGATATTGAAGGTGATGTCCCTCTGGCAAAGCGATTGCGAAGGTCATCTTCTGATGCTTTGCAAGATATGGTTGGCGGTGAAGAGCTTTCTTTGTATGGTTCGGCCCCAAATAACACTGAGTCAGCACAGAAGACTTTCTTGTTTGCAGTGAGAGACTCATTAACTAATGTTGGCCCTTTGAAGGACTTCTCATATGGCTTGAGGATTAATGCTGATGTGAATGCAACTGGAATTGCCAAACAAAGTAATTATGAGCTGGTGTGCTGTTCTGGCCATGGAAAGAATGGTGCCCTCTGTGTTCTACGACAGTCAATTCGTCCTGAAATGATTACCGAGGTTGAACTAACTGGTTGTAAAGGAATTTGGACTGTCTACCACAAGAGCACACGCAGTCACAGTGCTGATTTGTCTAAAGTGACTGATGATGATGATGAATATCATGCATATTTGATTATAAGTCTGGAGGCGCGCACCATGGTGCTTGAAACAGCTGATCTTTTGACAGAAGTGACTGAAAGTGTAGACTATTATGTTCAAGGAAGAACAATTGCTGCAGGAAATTTGTTTGGAAGGCGTCGAGTTGTCCAGGTCTATGAACGTGGTGCTCGAATTCTGGATGGTTCTTTTATGACTCAAGAACTGAGTATTCCATCACCAAACTCTGAATCTAGCCCTGGTTCTGAGAATTCTACAGTAATATCTGTTTCTATTGCTGATCCTTATGTGTTGCTAAGAATGACTGATGGAAGCATTCTTCTCCTTGTTGGAGATCCTGCTACTTGCACTGTTTCTATAAACACTCCAACTGCATTTGAAGGCTCAAAGAAAATGGTATCTGCCTGTACATTGTATCATGATAAAGGTCCAGAGCCATGGCTCCGCAAAGCAAGTACTGATGCGTGGCTTTCCACTGGCGTCGGGGAGTCCATTGACGGTGCTGATGGTGGGCCACATGATCAAGGGGATATATATTGTGTCGTTTGTTATGAGAGTGGTGCTCTTGAAATATTTGATGTGCCAAATTTCAATTGTGTTTTCTCTATGGAAAATTTTTCATCTGGAAGAACCCGCCTTGTTGATGCCTATACACTGGAATCTTCTAAGGATTCTGAGAAAGTGATTAATAAAAGTTCTGAAGAATTGACTGGCCAAGGCAGGAAAGAAAATGTTCAAAACCTGAAGGTTGTTGAGTTGGCCATGCAGAGATGGTCTGCAAATCACAGTCGTCCATTTCTTTTTGGAATATTAACGGATGGAACAATTCTTTGTTATCATGCTTACCTATTTGAAGGTTCAGAAAATGCTTCTAAAGTTGAGGATTCAGTTGTTGCACAAAATTCTGTTGGCTTAAGCAATATTAATGCTTCTAGGCTTAGGAATTTGAGATTTATTCGCATCCCGTTGGATGCTTACACGAGGGAGGAGATGTCAAATGGAACCTTATCCCAAAGGATTACAATTTTTAAGAATATTAGTGGTTATCAAGGGTTCTTCCTCTCTGGTTCAAGACCAGCTTGGTTTATGGTATTCAGAGAACGGCTTCGAGTTCATCCACAGCTATGTGATGGATCTATTGTTGCTTTCACTGTTCTTCATAATGTCAACTGTAATCATGGGTTCATATATGTTACATCGCAGGGTATTCTAAAGATTTGCCAAATCCCATCTGCATCAAACTATGACAACTATTGGCCAGTGCAAAAAATTCCACTAAGGGGCACTCCACATCAAGTGACTTACTTTGCTGAGAGGAATCTTTACCCAATTATAGTTTCAGTTCCTGTTCATAAGCCAGTTAATCAAGTGCTATCTTCATTGGTTGATCAAGAAGTTGGCCATCAGATGGACAATCATAATTTGAGTTCTGATGAGTTGCAACGAACTTATACAGTGGATGAGTTCGAGGTTCGGATTTTGGAACCTGAAAAATCTGGTGGTCCTTGGGAAACTAAGGCAACTATACCAATGCAGAGTTCTGAAAATGCTCTAACTGTGAGAGTGGTCACTCTGTTTAATACCACCACAAAAGAGAATGAATCCCTTTTGGCTATTGGGACAGCTTACATTCAAGGAGAGGATGTTGCTGCTAGAGGACGTGTGATTTTGTGTTCAATTGGAAGGAACACTGATAATCCTCAGAATTTGGTGTCAGAGGTTTATTCAAAGGAACTAAAAGGTGCTATATCTGCTTTAGCCTCCCTTCAAGGTCATCTATTGATAGCTTCTGGTCCGAAAATTATTCTACATAATTGGACTGGTAGTGAGCTGAATGGCATTGCATTTTATGATGCTCCACCATTATATGTTGTGAGCTTAAATATAGTCAAGAATTTTATCCTTCTTGGTGACGTTCACAAGAGCATATACTTTCTCAGTTGGAAGGAACAGGGTGCTCAACTTAGCTTATTGGCCAAGGATTTTGGCTCCCTTGATTGTTTTGCAACAGAATTTTTGATTGACGGAAGTACATTAAGTCTCATGGTTTCAGATGAACAAAAGAATATTCAGATATTTTATTATGCACCGAAGATGTCTGAAAGTTGGAAGGGACAGAAGCTTCTTTCAAGGGCTGAATTCCATGTTGGTGCCCATGTTACTAAGTTCTTGCGGCTCCAGATGCTTTCTACTTCATCTGATCGAACCAGTGCCACTGCAGGCTCTGATAAAACCAATCGATTTGCTTTGTTATTTGGTACTCTGGATGGCAGCATTGGCTGTATAGCTCCTCTGGATGAACTCACATTTCGTAGATTGCAGTCACTACAGAAAAAGCTTGTTGATGCAGTTCCACATGTCGCTGGCTTAAACCCAAGGTCTTTTCGGCAGTTCCATTCAAACGGAAAGGCTCATAGGCCCGGTCCTGACAGCATAGTTGACTGTGAGCTGCTTTGCCATTACGAAATGCTTCCATTGGAGGAACAGCTCGATATTGCTCACCAGATTGGAACGACGCGTTCCCAAATTCTCTCTAATTTAAATGATCTTACTCTGGGTACAAGTTTCTTGTGA
Protein:  
MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRGIGPVPNLIVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAVLSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINLRDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRSNFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDDDDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCTVSINTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGDIYCVVCYESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLTLGTSFL